CNN-based and DTW features for human activity recognition on depth maps

نویسندگان

چکیده

Abstract In this work, we present a new algorithm for human action recognition on raw depth maps. At the beginning, each class train separate one-against-all convolutional neural network (CNN) to extract class-specific features representing person shape. Each class-specific, multivariate time-series is processed by Siamese multichannel 1D CNN or determine actions. Afterwards, nonzero pixels shape in map calculate statistical features. On of such Dynamic Time Warping (DTW) They are determined basis DTW distances between all training time-series. Finally, feature vector concatenated with vector. For category multiclass classifier, which predicts probability distribution labels. From pool classifiers select number that an ensemble built them achieves best classification accuracy. Action performed soft voting averages distributions calculated largest discriminative power. We demonstrate experimentally MSR-Action3D and UTD-MHAD datasets proposed attains promising results outperforms several state-of-the-art depth-based algorithms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Action Recognition with Image Based CNN Features

Most of human actions consist of complex temporal compositions of more simple actions. Action recognition tasks usually relies on complex handcrafted structures as features to represent the human action model. Convolutional Neural Nets (CNN) have shown to be a powerful tool that eliminate the need for designing handcrafted features. Usually, the output of the last layer in CNN (a layer before t...

متن کامل

Digit Recognition based on Euclidean and DTW

This paper describes the implementation of two isolated digit recognition techniques and is a comparison between the algorithms implemented. Any digit recognition comprises of mainly two stages feature extraction and similarity evaluation. Here, two feature extraction techniques, namely linear predictive cepstral coefficients (LPCC) and mel frequency cepstral coefficients (MFCC) are implemented...

متن کامل

Depth Images-based Human Detection, Tracking and Activity Recognition Using Spatiotemporal Features and Modified HMM

Human activity recognition using depth information is an emerging and challenging technology in computer vision due to its considerable attention by many practical applications such as smart home/office system, personal health care and 3D video games. This paper presents a novel framework of 3D human body detection, tracking and recognition from depth video sequences using spatiotemporal featur...

متن کامل

Face recognition based on depth maps and surface curvature

This paper explores the representation of the human face by features based on shape and curvature of the face surface Curvature captures many features necessary to accurately describe the face such as the shape of the forehead jaw line and cheeks which are not easily detected from standard intensity images Moreover the value of curvature at a point on the surface is also viewpoint invariant Unt...

متن کامل

Features and models for human activity recognition

Human Activity Recognition (HAR) is aimed at identifying current subject task performed by a person as a result of analyzing data from wearable sensors. HAR is a very challenging task that has been applied in different areas such as rehabilitation, localization, etc. During the past ten years, plenty of models, number of sensors and sensor placements, and feature transformations have been repor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neural Computing and Applications

سال: 2021

ISSN: ['0941-0643', '1433-3058']

DOI: https://doi.org/10.1007/s00521-021-06097-1